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Abstract 

The challenge of self-optimization for orthogonal frequency-division multiple-access (OFDMA) inter- 
ference channels is that users inherently compete harmfully and simultaneous water-filling (WF) would 
lead to a Pareto-inefhcient equilibrium. To overcome this, we first introduce the role of environmental 
interference derivative in the WF optimization of the interactive OFDMA game and then study the en- 
vironmental interference derivative properties of Stackelberg equilibrium (SE). Such properties provide 
important insights to devise free OFDMA games for achieving various SEs, realizable by simultaneous 
WF regulated by specifically chosen operational interference derivatives. We also present a definition of 
all-Stackelberg-leader equilibrium (ASE) where users are all foresighted to each other, albeit each with 
only local channel state information (CSI), and can thus most effectively reconcile their competition to 
maximize the user rates. We show that under certain environmental conditions, the free games are both 
unique and optimal. Simulation results reveal that our distributed ASE game achieves the performance 
very close to the near-optimal centralized iterative spectrum balancing (ISB) method in [5]. 
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I. Introduction 



Dynamic spectrum access enabled by cognitive radio technologies has recently attracted much attention 
[1—3]. Of particular interest is the orthogonal frequency-division multiple-access (OFDMA) interference 
channel model in which multiple communication links coexist and share a spectrum of frequency bands 
dynamically without a centralized spectrum manager. 1 The model is motivated by its suitability to analyze 
multicell networks and the move of regulatory bodies in favor of liberalization and license-exempt spectrum 
policies for higher utilization. In interference channels, often infrastructureless, users are all uncoordinated 
individuals, and inherently compete with each other. How individuals gather sufficient network information 
and optimize themselves to benefit not only their own but the entire network is an open challenge. 

In this regard, game theory has emerged as a unique methodology to analyze a group of self-organized 
mutually competing users and can be adopted to design autonomous access control methods for OFDMA, 
e.g., [7-12]. However, as reviewed in [13], there are fundamental deadlocks that prevent existing results 
from making substantial impacts. The first deadlock is the low efficiency of Nash equilibrium (NE) where 
users are myopic and harmfully compete with each other [14] . NE is also not unique and its performance 
is unpredictable. To avoid over-competition, Stackelberg equilibrium (SE) arises where foresighted users or 
leaders can benefit more from the game, by knowing full CSI and the strategies of all other myopic users 
or followers. In [15], one-leader SE (OSE) for OFDMA was studied revealing a large gain over NE. 

Unfortunately, there are operational obstacles in achieving OSE. First, to be qualified as a leader, the 
user should possess tremendous cognition capability, which, albeit, could be learned from the environment 
by conjectures [16]. 2 Worse, obtaining the OSE would require playing a bi-level game, 3 in which there is a 
strict order of how the game proceeds to reach the designated equilibrium. For OFDMA of uncoordinated 
users, this is unrealizable. Another long-standing issue is that OSE is not known to be unique. 

The challenges do not end there, as there is strong desire to extend the SE framework to the cases 
where multiple leaders coexist [16-19]. In [18], DeMiguel et al. studied the extension and in their definition 
of -fT-SE (K leaders out of all users), leaders are myopic with respect to other leaders, but foresighted to 

1 For infrastructure-based communications where a centralized spectrum manager knowing full channel state information 
(CSI) between every transmitter-receiver link is present, optimal approaches are well known [4-6], but they are hardly practical 
for infrastructureless OFDMA systems if a large number of users are involved, let alone its susceptibility to CSI errors. 

2 Note that for OSE, while conjectural equilibrium (CE) can be used to make the leader learn the necessary network-wide 
information, but it was acknowledged in [16] that extending it to -ff-SE scenarios is a topic for future investigation. 

3 To achieve OSE via a bi-level game, the leader will start his action first, wait for all the followers to play a sub-game to 
reach an NE, then revise his action, and the whole process repeats until convergence. A bi-level game is needed even for CE. 
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followers. To approach the -fT-SE, usual challenges in OSE remain. Further, according to their definition, 
if all users are leaders, the game degenerates to an NE, losing the foresightedness advantages of the leaders, 
making it inefficient for OFDMA systems. Most recently, -fT-SE was employed for femtocell communications 
in [19] where strong assumptions on leaders such as full CSI 4 and exhaustive optimization, apply. 

The paper aims to achieve a truly self-optimized OFDMA network where every transmitter individually 
optimizes its own power and subcarrier allocation, based on its local CSI and interference observations, for 
maximizing the network rates. To this end, we first revisit the water-filling (WF) optimization of a user's 
parameters by introducing the role of interference derivative which measures the environmental changes 
with respect to a change in the power allocation strategy of that user. We show that the environmental 
interference derivative plays a crucial role in the uniqueness of the OFDMA game, and by analyzing the 
environmental interference derivative property of NE and OSE, we can construct a free OSE game that 
can be approached by simultaneous (or iterative) WF 5 in an arbitrary order, regulated by an operational 
interference derivative. As a final outcome, we present a new all-leader SE (ASE) definition under which 
users are all foresighted to each other with their belief that others are myopic. By extending the analysis, 
we devise a free ASE game that is achievable again by a regulated simultaneous WF interaction. We also 
show that under certain environmental conditions, the proposed free games have unique equilibriums and 
optimal. In the context of OFDMA, this paper has made the following major contributions: 

• We provide a new sufficient condition for the uniqueness of NE. 

• We prove that OSE is unique and a free OSE game can be constructed. 

• We define a new ASE and propose a free ASE game which is unique under certain conditions. 

• The free games can be achieved by simultaneous generalized WF (GWF) empowered by the specifi- 
cally chosen operational interference derivatives, with the aid of only users' local CSI. 

• Simulation results reveal that the proposed free ASE game achieves the sum-rate very close to that 
of the centralized iterative spectrum balancing (ISB) method for the OFDMA channel in [5]. 

• Our analysis provides the sufficient conditions for convergence to NE, OSE and ASE. 

4 In this paper, full CSI refers to the CSI knowledge of every transmitter-receiver link of the entire network, in contrast to 
local CSI which we refer to it as the CSI only for a transmitter to its intended receiver. 

5 It is worth pointing out that in our proposed scheme, the WF solution incorporates the operational interference derivative 
into it and is fundamentally different from the conventional WF and the modified WF in [20] . 
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II. Preliminaries 



A. OFDMA Interference Channels 

Consider a If -user OFDMA infrastructureless system, as shown in Fig. 1, where each user is free to occupy- 
any of the N orthogonal subcarriers for communications. The users operate in a non-cooperative manner 
and inherently compete with each other. For user k, the total transmitted power is constrained by 

N 

J>kM<i\, Vk€{l,2,...,K}, (1) 

71=1 

where p^ [n] denotes the power allocated for the nth subchannel by user k and denotes the maximum 
total power for user k. We write p k = {pfc[l],pfc[2], . . . ,pk[N]} as the power allocation pattern of user k, 
which is drawn from some power allocation strategy Vk, or denoted by £ Vu- 

Let Hij [n] denote the flat-fading channel coefficient from transmitter i to receiver j and Nk [n] denote 
the noise power density for the complex additive white Gaussian noise (AWGN) at receiver k on the nth 
subchannel. The achievable rate for user k can therefore be given by 

where <7fc[n] = .^["L is the normalized noise power on subchannel n, ijjnl = Y^Li pi[n}6ik[n] is the total 
interference power on subchannel n for user k, with 9ik[n] = Jg^MI 2 denoting the normalized (by user 
k) channel power gain from transmitter (or interference) i to receiver k, and c^[n] = cr^fn] + Ik[n\- For 
transmitter k, it possesses only the knowledge of local CSI which corresponds to {c^[n]} for all n. 

B. Input-Output Subsystem: Environmental Interference Derivative 

In this setup, users are all uncoordinated individuals and each user will allocate its power over the sub- 
carriers to maximize its own rate based on its observation of the environment and its belief on how the 
environment would react to its action. The environment user k observes can change because other users 
may alter their strategies to respond to the environmental changes caused by its own action. As such, it is 
a dynamic process where users all interact and could converge to a compromised equilibrium such as NE. 
To model this interaction, from user k's viewpoint, we can regard other users as a subsystem which 
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takes its power allocation at time t as inputs (i.e., and produces a new interference pattern at 

time t + 1 as outputs (i.e., { as shown in Fig. 2. Under this model, we define the environmental 

interference derivative, <p k [n] = 9 Q k pt ^ , which measures the environmental change on a subcarrier seen by 
a user, caused by a change in the user's own power allocation on that subcarrier. This captures the essence 
of the overall network response and is a key parameter that determines whether or not such user interaction 
converges, and helps develop self-optimized algorithms for achieving desirable equilibriums (e.g., ASE). 

C. GWF and Operational Interference Derivative 

Let V = \V\iVii ■ ■ ■ ,Vk} be the set of all users' power allocation strategies and p/c G V k Vfe. We define 
P-ifc — {pi, ■ ■ • , Pfe-i, Pfc+i) ■ ■ • i Pk} embracing the power allocation of all users except user k. User fc's 
power allocation can be optimized by solving the following rate maximization problem (with t omitted): 

p£ = arg max ^log 2 1 + s . t . ^ Pfc [n] < P fe , (3) 

which can be solved by a Lagrangian multiplier formulation 



N 
n=l 



K 1 + ^)- A (|>l"l- ft ) < 4 > 



with the Lagrange multiplier A. To proceed, it can be easily shown that 6 

dC ip k [n] + l (p k [n\ 



dpk[n] c k [n]+p k [n] c k [n] X r] k [n]+p k {n] A ' ^ 
where 

n u a 4[n] + <p k [n}p 2 k [n] 

Vk[n\ - — n n pr (b) 

c k [n] - (p k [n\p k [n\ 

is the "interactive" network noise due to p k [n] through the environmental interference derivative. 
The Karush-Kuhn- Tucker (KKT) conditions for optimality suggest a WF strategy 

p k [n) = (w k - Vk[n}) + , (7) 



3 The derivation can be done by recognizing that f±i - f = {c + p)c = (c+p) l = c2+ * p2 — . 

c — ipp c — ipp P P c — ipp 
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where (a) + = max(0, a) and Wk = ^ [n]-yfc[n]pfc[n] +PfcN' f° r Pk[ n ] > 0, is chosen to satisfy the user's 
power constraint ^2 n Pk[n] < -Pfc, and interpreted as the "water-level" of the solution. Incorporating the 
interactive nature of the system, the optimal power allocation for user k can be realized by 



As a result, to use the above solution (8), the user needs to have a belief on how the environment would 
react, an operational interference derivative, or a belief on <£>jj.[n], denoted as </3fcN- For instance, NE is 
defined such that at the equilibrium (with the corresponding parameters marked with *) it satisfies 



Since the rate maximization for an NE user is performed with respect to some fixed interference pattern, 
{Ik(p*-k)[ n ]}i eac h user nas a belief that the environment will remain static following a change of its power 
allocation strategy. In other words, (f>k[n\ = 0, leading to the well-known simultaneous WF [14] 



The operational interference derivative is the key to which user's foresightedness can be nurtured. By 
a proper choice of the operational interference derivative it is possible to create a free OFDMA game for 
approaching a more desirable equilibrium, which we will address next. It is worth emphasizing that the 
GWF in (7) is fundamentally different from the modified WF in [20]. The main difference is that in our 
GWF there is an interactiveness through r/k [n] which does not exist in the WF in [20] . 

III. SE and Its Free Game Realization 

A. New Key Results of NE 

Depending on how users react to the environmental changes, the system may converge to an equilibrium. 
Throughout, we use the superscript * to denote the parameters at the equilibrium so the overall strategy 
is P* = {p* k ,p*_ k }. NE systems are Pareto-inefficient [7-10,14], which has motivated the concept of SE 





Rk(p* k , P_fc) > RkiPk, P-k) Vfc. 



(9) 



p t M = H-c\[n]Y Vfc. 



(10) 
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in which there can be foresighted users who can maximize their rates more effectively by knowing other 
myopic users' actions. In light of this, this paper aims to devise self-optimization methods that can lead 
the OFDMA game to an SE. To do so, we first develop some new results of NE and consider the scenario 
where user k is the user of interest and other users, for i / k, are at NE, with user i's power allocation 
denoted by NEj(p K ) = |iV\Ej(p K )[l], . . . , NEi(p K )[N]}, which is a function of user k's power allocation. 

Lemma 1 The rate of change of the power allocation for user i with respect to that for user k is given by 

dNEi(p K )[n\sgn( Pi [n}) 

^ = -e Kt s g n( Pt [n]), (11) 

where sgn(a;) returns one if x > or zero if x < 0. 

Proof: First, assume that Pi[n] > 0. Then, by chain rule, we have 

dNEi(p K )[n] dNEifrJln] dI t (p K )[n] 



dp K [n] dli[n] dp K [n] 



(12) 



As user i is using an NE strategy, we have, from (10), that pi[n] = NEi(p K )[n] = wi — ai[n] — Ii[n] > and 
therefore, 9N q}^]^ = — 1- On * ne °^ ev hand, by definition, we have §^j^ = K i[n]. Finally, note that 
if Pi[n] = 0, the rate of change will be zero. Therefore, (11) is obtained, which completes the proof. □ 

Lemma 2 The environmental interference derivative for an NE subsystem is given by 7 

<9/ K (NE_ K (p K ))[n] ^ . . r . . . 

Tj— n = - > 0n[n]6 iK [n]sgn(pi[n]) < 0, (13) 

where NE_ K (p K ) 4 {NEi(p«), . . . , NE K _!(p K ), NE K+1 (p K ), . . . , NE x (p K )}. 

Proof: Noting that I K [n] = E^k^N^N = T,i 9 i K NE i{Pn)[n]9i K [n], we have 

dI K (NE- K (p K ))[n] _ d(E l ^NE, t (p K )[n}9 tK [n]) dNEj^M nA , 

" dpM ~ ~ dpM ~h dpM ■ li4j 

Then, the final result can be obtained by substituting the result of Lemma 1. □ 



7 The notation 7 K (NE_ K (p K ))[n] illustrates that the interference seen by user At, I K [n], is a result of the power allocation 
from other users, NE_ re , which is caused by the power allocation of user k, p re . 



Theorem 1 NE is unique in the OFDMA game if the environmental interference derivative satisfies 

0I fc (NE_ fc (p fc ))[ra] 

= — > —1, /or a// Pfc ana for all k,n. (15) 

Proof: To prove the result, it suffices to show that u>£ is unique for all k, and we are not interested in 
those p* k [n] = because they do not cause interference to others nor consume any power budget. 
At NE, for p* k [n] > 0, we have 

wt=pt[n] + cl[n]. (16) 

Then, the relationship between the water-level and the optimal power allocation can be studied by dif- 
ferentiating w k with respect to p* k [n] which gives = 1 + ip k [n]. If (15) is true, or <p k [n] > — 1, then 
S™} i > 0. In other words, if we have two water-levels such that w k < w k , then their respective power 
allocations will satisfy J2 n Pk[ n ] < Yl n P~k[ n ]- Hence, there is a unique w k such that ^ n p* k [n] = Pk- D 

Corollary 1 Denoting 9 = maxi^Qki, « sufficient condition for the uniqueness of NE is 

e< -. 1 . (17) 

Proof: If (17) is true, then 1 > (K — l)9 2 > J^Ci 6 K i[n]9i K [n]sgn(pi [n]), which implies that tpk[n] = 
i K [n]sgr\(pi[n\) > —1. As a result, according to Theorem 1, NE is unique. □ 
Theorem 1 provides a new way for analyzing the uniqueness of the equilibrium of an OFDMA game 
through the environmental interference derivative. On the other hand, comparing to the literature requiring 
9 < -^-j- [7,23-25], Corollary 1 provides a less stringent sufficient condition for the uniqueness of NE. 

B. OSE and A Free Game Implementation 

Definition 1 For OSE where user k is a foresighted user and the rest are myopic followers, we have 

R K (p* K , NE_ K ( P ;)) > R K (p K , NE_ K (p K )), 

(18) 

R k (pl, P*- k (p%)) > Rk(Pk, P- k (pl)) VA; + k. 

The OSE above, as defined in the conventional fashion, is by nature a bi-level game for the foresighted 
user k because the action user k takes is expecting an NE response over the rest of the users, as seen in 



(18). Clearly, the non-Stackelberg users (k ^ k) are myopic and play the game as in NE. In the following, 
we will study the environmental property of OSE and use it to create a free game version of OSE. 



Theorem 2 Given the foresighted user k in OSE, we have 



2c*[n]+p*[n] 



(19) 



Proof: Given that at OSE, we have p*, we now consider a power allocation strategy, p K , such that 
Pk[1] = pVA V 7^ m,n>, but p K [n] = p* K [n] + Ap and p re [m] = p* K [m] — Ap for some 1 < m / n < N and 
small Ap > 0. Denote the interference patterns for p* and p K , respectively, as \ K and \ K . Using Lemma 2, 
we can write I K [n] = I K [n] — AI[n] and I K [m] = I K [m] + AI[m] for some small AI[n] and A/[m] > 0. As 
user k's rate is maximized by p* at OSE, we therefore have R K (p K ) — Rk(p%) < implying 



log 2 1 + 



P*M+Ap 



a K [n] + I*[n] - AI[n 
+ log 2 



log 2 1 



p* K [m] - Ap 



<7 R [m] + I^[m] + AI[m] 



- log 2 1 + 



p K [m\ 



a K [m] +I*[m] 



< 0. (20) 



This can further be simplified to 



a K [n] + I*[n] - AI[n] + p%[n] + Ap \ ( a K [n}+I*M \ 
a K [n] + I* [n] - AI[n] J \a K [n] + /* [n] + p* K [n] J 

x / a K [m] + I*[m] + AJ[m] + p*[m] - Ap \ 
V o- K [m] + I*[m] + AI[m] j 



f ^M + gM )< L (21) 

\a K [m\ +I*[m\ + p* K [m\ J 



Now, using (21) and noting that it is valid also after swapping m and n because subchannel indices m and 
n are arbitrary, it can be easily shown after some manipulations that 



1 - 



(AI[n] - Apf 



(a K [n] + I*[n] + p* K [n]) 2 



(AI[m\ - Apf 



(o- K [m] + I*[m] +p* K [m]) 2 
AI[n} 2 



< 1- 



{o-M + i*MY 



\Ap- AI[n}\ 



> 



AI[ 



n 



a K [n] + I* [n] +p* K [n] a K [n] + 

V v ' 

c*[n] 



AI[n] 



Ap 



< 



AI[m]' 



(a K [mj +I*[m]) 2 



2c* [n] +p* K [n] 



Vn. 



Vm,n, (22) 



(23) 
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Taking the limit Ap — > and knowing that p* K [n] < (Lemma 2), we get the desired result of (19). □ 

Theorem 3 As long as the environmental interference derivative exists, ip K [n], OSE is unique. 

Proof: From our GWF analysis, for p* K [n] > 0, we have w* K = n* K {n] + p* K [n] which gives 

c 2 + pp 2 , , „x 

w = h p (24) 

c — pp 



where the indices n and n as well as the superscript * are omitted for convenience. In [16, Proposition 1], 

dp - U - lnen ) dp 

dw (c — (pp)(2c(p + 2pip) — (c 2 + (pp 2 )((p — ip) 



it is known that in OSE, ip is a constant and hence 5^ = 0. Then, ^ can be derived as 



dp (c — (pp) 2 

2c<p + (pp + c 



+ 1 (25) 

(26) 



c — pp 

Because ip < (Lemma 2), we have c — pp > 0. From Theorem 2, we also have ip > —^i+p"- Therefore, 

dw 



3p ~ > v 7 



Using a similar argument as in the proof of Theorem 1, w* is unique and so does OSE. □ 
Also, from our GWF formulation in Section II-C, we know that jSftri = — A which further gives 

°P K I 71 ! K 

PL (IV &wl 



d(p* K [n]) 2 \w* K J dp* K [n}' 

From the proof of Theorem 3, we have already known that gpr^ > 0. As a consequence, g^r^jp < 0, 
which is a sufficient condition for optimality of p* K [n] for maximizing user k's achievable rate. 

The environmental interference derivative property of OSE in Theorem 2 is key to developing a free 
OSE game for self-optimization of OFDMA. To do so, we first create an artificial OFDMA game where 
user k is foresighted and other users are myopic, and the environmental interference derivative, cj>, meets 

CM = -— ^ < V *M < 0. (29) 

c*[n] 
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The above artificial game will approach the bi-level OSE game if ^j^j is large for all n such that p* [n] > 
because 4>* K [n] — > <p%[n]. In this case, the optimal power allocation for both games will be identical. We 
conjecture that for > 1, there will be a strong tendency that foresighted user k will own the subcarriers 
that it chooses to occupy and in so doing on those chosen subcarriers ^fH will be large and as a result the 
artificial game provides an accurate representation of the original bi-level OSE game. 

Inspired by this, we propose to use the GWF by choosing the operational interference derivative (i.e., 
the belief of the environmental interference derivative) for foresighted user k as (see also Section II-C) 



c* \ri\ c^ \n\ 

<2>* K \n] = — L — -, or operationally, <5* \n] = Kl J , 1 — . (30) 

2c* [n] +p%[n] P J 2c*>] +^" 1 [n] 



Therefore, we have the following free game for approaching the OSE. 

Proposition 1 A free OSE game lets users iteratively and simultaneously optimize their strategies using 
GWF with user k choosing (30) and other myopic users choosing (f>k[n] = V/c / n so that 



{ci[n]f + ^[n]{pi-Mf 



, where (p K [n] in (30), 



4M-£«Mp* N / (31) 

PkN = ( w k ~ 4N) + , for k^K. 

With the operational interference derivative (30), user k in this free game does not need to wait for an 
NE response of the rest of the users while adapting its power allocation strategy, and the equilibrium of 
the free game can be achieved by simultaneous GWF in any arbitrary order. 

Theorem 4 If ip* K [n] 

~ dp%[n] ^ 0.5, ^ e f ree game has a unique equilibrium and the GWF in (31) 
achieves the unique equilibrium which is also the global optimum for maximizing user k 's rate. 

Proof: With the operational interference derivative (30) in GWF, we have, for p* K [n] > 0, that 

* (c* \n}) 2 + c* [nip* In] „ . . pt\n] 

w* = v K \ V ^rr = < n + ■ (32 

c* K [n]-^ K [n}p* K [n] 2 

Therefore, if ip* K [n] > —0.5, then we get 

'% = <p* K [n] + I > 0. (33) 



dp*M J 2 
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Since g—r^ > implies that w k is unique, the equilibrium of the free game is unique. For the same reason, 
as shown in (28) previously, we have gj^j^p < and therefore, p%[n] obtained by the GWF in (31) will 
converge to the unique equilibrium and is the global optimum under the condition <p%[n] > — 0.5. □ 
Unlike the bi-level game, in the free game, <p K [n] = g^fej / ^"^dp"^]"^^ because the interference 
seen by user k is no longer the outcome of the NE response of all the myopic users. Also, within the free 
game, it does not seem possible to know the exact value for tp K [n] = g^j^j- However, the way the free 
game is carried out is based on the belief that the interference derivative has the property (31) which is 
the property that permits the free game to approach the bi-level OSE. In this belief, ip K [n] > —0.5. 

C. ASE, Its Variants and A Free Game Implementation 
Definition 2 Users can all be completely foresighted and in this ASE, we have 

R k (p* k ,ASE_ k ( ? l)) > i4(p fe ,ASE_ fc ( Pfe )) Vfc, (34) 

where ASE_ k (-) corresponds to the ASE strategies from all other users except user k and is defined similarly 
as NE_fc(-) used previously. Under this definition, ASE is a multilevel game with (K — 1) levels. 

Note that (34) is the ultimate definition for ASE with complete foresightedness for all users. However, 
under (34), users strategies are strongly coupled together and it is not known if this is achievable, if not 
impossible. This partly explains why in [18] foresighted leaders are not supposed to be foresighted with 
each other and as a consequence in the case of ASE, users become myopic, losing their foresightedness 
capability. Motivated by our OSE results, instead, one could define ASE in the following way. 

Definition 3 Users are all foresighted with the belief that others are myopic. In this ASE, we have 

R k (p* k , NE-jfc(p)!:)) > R k (p k , NE_ fc ( Pfe )) Vfc, (35) 

where NE_^ indicates that user k believes that other users will come to an NE in response to its action. 

In (35), although users are not foresighted to each other, they are foresighted to some belief of the 
overall network response. In particular, every user has the belief that all other users are myopic and give 
an NE response to its action. This version of ASE is a natural extension to OSE and can be interpreted 

11 



as having K interfered OSE games. The main issue of (35) is that the construction of the game is limited 
to "foresighted-myopic" interaction and the OSE games will interfere with each other, whereas in ASE one 
would expect to facilitate "foresighted-foresighted" interaction, which is the major challenge. To nurture 
the required "foresighted-foresighted" exchange, we have the following theorem. 

Theorem 5 The environmental interference derivatives for ASE, {^>* k [n\\, satisfy 

,„* r„i _ V" 0ki[n]6 ik [n}sgn(p*{n]) 

i — l ~ I L J 

i^k 

Proof: First, by definition, we have 



r - 9It[n] ^ r i#P;M / *r r i ®P*i N 9If [n] 

fkin] = = E^w^w^U = E^ww)W ^L aJ^L r (37) 



in which = (g^j) = (</>*M) \ and = 9 ki [n]. As such, the property (36) is obtained. □ 

Theorem 5 provides a key property of ASE which facilitates the exchanges of foresightedness between 
users. With this result, Definition 3 in (35) can go beyond a group of interfered OSEs to become a group 
of foresighted OSEs. We will show that it is possible to gain insight from the property (36) so that 
self-optimization OFDMA can be achieved. Before we address this, the following lemmas are useful. 

Lemma 3 With user k believing that other users (I / k) are myopic, we have 

Ew**hn«W)<j^|^p|. m 

Proof: Based on the belief of user k, we start by writing ^tj^j as 

AI*[n] (o)^. r ,Apl[n]A/;[n] . „. (*) c* k [n] 

= g tel " 1 4RMH 5gn(! " [ " l) - -^trm- (3! " 

where (a) is by definition and (b) is due to (19) (true as long as the rate at the equilibrium is the highest). 
Now, knowing ^ftj^j = —1 (as in the proof of Lemma 1) and ^| |"j = 6 k e yield the result (38). □ 

Proposition 2 For a self- optimized OFDMA network where each user knows only its local CSI, the envi- 
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ronmental interference derivative for user k to achieve ASE (Definition 3) should satisfy 



fl[n]>-\ n , r C f W , r 1 Vfc, n. (40) 
Vk[ ~ V 2c* k [n}+p* k [n] 

Proof: Note that (36) is true only if p* k [n] > because otherwise the subchannel is not used and 
p* k [n] = 0. Therefore, more precisely, if 6ij[n] = for i = j, then we have 

X>£M = E S ^? ll E^N%[n]sgn(p|[n]) (41a) 

k=l k=l ^ J £=1 

(g ^ sgn(g|[n]) / c|[n] \ 

where (a) is due to summing both sides of (36) over all A; and (6) is due to the result in Lemma 3. With 
self-optimization in mind that each user should operate based on only its local CSI observation, i.e., {ojn]} 
for user k, the above property can be enforced by comparing term- by-term on both sides giving rise to 



r>]< S -^(^3^)^-J^^^'M<0. (42, 



(fl [n] V 2c fc M + Pt N / V 2c l t n ] + pI i n 

Note that the negative is chosen to give (40) because <p%[n] < 0. □ 
It is worth emphasizing that (41b) and hence (40) is a weaker environmental interference derivative 
property for ASE which is a result deduced from the original property (36) but satisfying (40) does not 
necessarily imply (36). Nonetheless, it does bring the possibility of network foresightedness based on local 
channel observation and will therefore make self-optimization realizable by the following proposition. 

Proposition 3 A free ASE game can be constructed by simultaneous GWF such that 



,2* 



cl\n\ 



c*[n] -^ k [n]p k [n] I \/2c t fc [n]+p fe [n] 



The above free ASE game proposed follows the same rationale behind the free OSE game described 
before. In particular, the operational interference derivative for carrying out GWF is chosen to satisfy the 
environmental interference derivative property of ASE (40) with equality on the basis of approaching the 
original multilevel ASE at high SNRs. Furthermore, being a free game, the equilibrium can be achieved 
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by simultaneous GWF in any order. The following theorem addresses the uniqueness of the free game. 

Theorem 6 If ip* k [n] > — 4e „ ^j^* ^ Vfc, n such thatp* k [n] > 0, the free ASE game has a unique equilibrium 
and the GWF in (43) achieves the unique equilibrium and is also optimal for rate maximization. 



Proof: To begin, we find it useful to define J k [n] = J — h ^^ s ^ so that <p%[n] = — due to the 
GWF in (43) and p* k [n] = (■jl[n]) 2 c* k [n\ - 2c* k [n\. Then, for those k,n such that p* k [n] > 0, we have 

* u = (4[n]) 2 + c* k [n]p* k [n] = c| [n^M^M + 1) 

k[ J c*[n]-^[nb*[n] 7 *[n] + 2 " 1 j 

To see how w k [n] varies with respect to pi [n] , we first obtain 

V C ^N Y Z+ c*[n] 

Therefore, 7^[n] increases as p* k [n] increases. On the other hand, ^ = ( 7 *[ n "]+2) 2 > ^ an( ^ h 611 ^ ^ 
p* k [n] increases, then J k [n] will increase and therefore will increase accordingly. 

It remains to show that c^.[n]7^[n] is also increasing with p£[n]. To do so, we note that c^[n]7^[n] = 
y/2(c k [n]) 2 + P* k [n]c* k [n]. As a consequence, if <pf.[n] > — 4e * ^ is satisfied, we obtain 

dc* k [nh* k [n] _ 1 1 



[(44N+^N)^N + 4H>o. (46) 



nj 

As a result, under the aforementioned condition, wt[n] is increasing with p* k [n] and hence the power 
constraint ^2 n P k [n]. Following the same argument as before, w* k and hence the free ASE game is unique. 
From (28), the equilibrium is also optimal in maximizing the user's rate, which completes the proof. □ 

IV. Convergence Analysis 

The power-interference game can be generally analyzed by recognizing that 

Ap k = p k AI k , for some p\ < (observe and action; GWF reoptimization) , 

(47) 

Al£ +1 = (p k Ap k , for some ip k < (action and reaction; environmental response), 
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where the subcarrier index n is omitted for convenience. In what follows, we can write 



i^k 



(48) 
(49) 



Convergence can take place in two possible cases: (i) |Ap* +1 | < \Ap\\ for all t > t for some to > 0, 
and (ii) |Ap* +1 | > \Ap\,\ for all t > t for some to > 0, the latter of which belongs to the case of strong 
interference channel. To proceed, we define \Ap t \ = maxfc|Apjj.|, \Ap t \ = minfc|A^|, |p*| = max, IpjJj, 
1/9*1 = min/j |/9||, \9\ = maxj^- 1 6*^- 1 and |0| = min^j \9ij\. Our objective here is to illustrate that it is 
possible to obtain sufficient conditions for convergence on the channel parameters \9\ and \9\. 

To have (i), we start by expressing 



|Ap* +1 | == max 

k 



i^k 



< \ft+i\\&ifi\6(K-l), 



(50) 



where (a) is due to (49). Therefore, a sufficient condition for convergence is to have 



\^\e{K-i) < i =► 9 < 



(K-l)\p^\ 



(51) 



because this will guarantee |Ap* +1 | < \Ap t \ Vfc, leading to Ap^ = Vfe. 

On the other hand, for (ii) |Apjj. +1 | > |A^|, this will converge as well since |Apjj.| is finite due to the 
total power constraint, P k . This can be interpreted as the scenario where a user, say k, decides to allocate 
all its power to given subcarriers (if Ap\, > 0) or to withdraw all its power (if Ap l k < 0). The former 
illustrates the case that user k wants to own the subcarriers and other users will have a strong tendency 
to leave the subcarriers due to severe interference caused by user k. A natural consequence is that those 
subcarriers will be occupied by user k only, and hence, (p k = 0, resulting the strong interference channel 
studied in [21,22]. To obtain a sufficient condition for such convergence, we consider 



t+1 1 



p\ +1 £ Ap% k 
i^k 



> |p m ||Ap*|^ Vfc, 



(52) 
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where we have used the fact that {Ap-}vi are all of the same sign (all positive or all negative). As such, it 
will converge to a strong interference channel if 



\8>l^0> 



1 

it+ii 



(53) 



Fig. 3 illustrates the convergence regions based on the two sufficient conditions above. 

Based on the convergence regions described above, the convergence behaviors of various equilibriums 
can be analyzed by understanding p\. In particular, we know that for p\ > 0, we have 



t _ a p{ _ A K - vlY 

Pk 



A 4 _ _ A 4 

Acf 



As a result, we have the following lemma for evaluating p\ of NE, OSE and ASE. 
Lemma 4 Based on (54), it can be derived that 



(54) 



\pI\ = 



A vl 



Act 



pI 1 



*M = (NE), 
■t ~t -* 



t k f _ t ( OSE with user k as leader), 

t-1 



h - E=T when is large (ASE). 



(55) 



Proof: In this proof, for convenience, we will omit the subcarrier index n, the user index k and the 
iteration index t. Also, we note that \p\ = |j = For NE, r\ = c and therefore \p\ = 1. 
In the case of OSE, using (6) and the corresponding definition of (p in OSE, we have 



c 2 + 



( 2c+p) 



C + 



= C "2 



\P\ = 1. 



(56) 



We start our analysis for the case of ASE by expressing 



dr, = c 2 - 2c£p + eg p 2 - yp 2 + c 2 f p 
dc (c — (/3p) 2 



(57) 



On the other hand, we can also examine ^ to obtain 



dip 



P 



dc 2${2c + p) 2 c 2 



(58) 
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Substituting this result into (57) then gives 

dv = c 2 - 2c£p + \^ - £p 2 + |gV g 
dc (c — (pp) 2 

Now, for p > c or large f, we have <p = ~ S J 2 £^ ~ ■ Consequently, |p| = §2 « ^<p2 = This 

completes the proof and knowing |p| will be useful in understanding the convergence behavior. □ 

For NE, |pjjj = 1 and a sufficient condition for convergence is < j^zr[, as reported in [7,23-25]. Also, 
our analysis shows that > 1 will ensure convergence as well and in this case a given subcarrier will be 
occupied by one user only, which aligns with the result for the Gaussian strong interference channel in 
[21,22]. Since \p\.\ = 1 for OSE, it has the same convergence behavior as NE. 

For ASE, as opposed to NE and OSE, \p k \ can take very different values depending on -t— . Particularly, 

k 

if is large, then |pjj.| will be very large and therefore convergence will tend to occur on the second region 
> 0, in which case according to our analysis users at the equilibrium will be made orthogonal. 



V. Simulation Results 

In this section, we provide simulation results of the sum-rates achieved by the proposed algorithms. The 
iterative spectrum balancing (ISB) method in [5], which is a near-optimal centralized algorithm, is used as a 
performance upper bound. First, we consider an example 2-user 2-subcarrier network (i.e., K = 2, N = 2) 
[15] where <n[l] = a 2 [2] = 4,<7i[2] = a 2 [l] = l,0i 2 [l] = 9 12 [2] = 21 [1] = 9 2 i[2] = 0.5, Pi = P 2 = 10 so that 

• Rl - m + Rll 2] = lo g2 (l + + lo g2 (l + r ^|| M ) , 

1 * ^ *W + « ^ ">* (' + IT^U) + *>* (> + oUbl) ■ 

The solutions for NE, OSE and ASE (due to Definition 2 in (34)) can be easily derived analytically as 





Pi[l] 


= 2, 




Pi[l] 


= o, 


Pi[l] 


= 0, 


NE: < 


V\ [2] 


= 8, 

OSE: 


< 


-Pi [2] 


= io, 

and ASE: < 


Pi [2] 


= 10, 




P2[l] 


= 8, 




P2[l] 


= 9, 


P2[l] 


= 10, 




P2[2] 


= 2, 




P2[2] 


= 1, 


P2[2] 


= 0. 



(61) 



V V V 
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The simulation results for this channel are given in Table 1, where it is observed that the proposed 
iterative GWF algorithm can be used to take users to the equilibria of interest. In particular, ASE tends 
to put the users into orthogonal subchannels and achieves the highest user rates. 

Next, we provide simulation results averaged over a large number of independent channel realizations. 
In the simulations, we model each subcarrier channel by an equal-power four-ray Rayleigh fading channel 
(see Fig. 4) [26], i.e., \H t] [n}\ 2 = \hff[n}\ 2 + \h\f[n}\ 2 + \hf[n}\ 2 + \hf[n}\ 2 , Vi,j, and we assume that 
E[|#y[n]| 2 ] = x and E[\h\f [n]\ 2 ] = 0.25x W, for i / j. For % = j, we set E[\H kk [n}\ 2 } = 1 and E[|4fcN| 2 ] = 
0.25 W. Hence, x is the parameter that measures the relative severeness of the interference channel. 

Table 2 provides results showing the likelihood of convergence for different games with P k = 100 V/c 
and iVfc[n] = 0.01 V7c, n. As shown, there is a small percentage of cases where NE and OSE diverge and 
as the number of users increases, this will become more problematic. In contrast, ASE always converges 
except when x is small meaning that the channel has only very weak crosstalk. This is because for small 
x, noise becomes the dominant effect which does not react to the players' actions but users in ASE tend 
to mistakenly regard the overall noise as interference due to other users. As such, there will be a small 
percentage of times (less than 5%) for ASE to diverge. In terms of convergence speed, nevertheless, ASE 
generally would take longer to converge than NE and OSE (results not shown here due to space). 

In Table 3, results are provided for the average users' sum-rates for a 3-user 9-subcarrier interference 
channel, with P k = 100 \/k, Nj~[n] = 0.01 Vfc,n and x = 0.4. Results show that there is a considerable gain 
in the sum-rates using ASE over NE and OSE. The cumulative density functions for the sum-rate ratios 
between ASE and NE users and between OSE and NE users are given in Fig. 5. Results illustrate that 
for ASE it is possible to have the sum-rate 5 times greater than what is achieved by NE. Furthermore, we 
compare the performance between ASE and ISB [5]. Such results are given in Table 4 where we assumed 
Pk = 100 Vfc, Nk[n] = 0.01 V/c, n and x = 0.4. Results show that the average per- user sum-rate for ASE is 
about 90% of that for ISB and is not decreased much when the number of users/subcarriers increases. 

To provide a more complete comparison between NE, OSE, ASE and ISB, in Fig. 6, we provide the 
average system sum-rate results for various x (which varies the severity of interference between users). A 
3-user 9-subcarrier channel with Pk = 100 V7c, Nk[n} = 0.01 V/c, n and x from 0.001 to 10 is considered. We 
discuss the results by looking into three separate regions: (1) x < 0.03, (2) 0.03 < x < 4 and (3) x > 4. In 
Region (1), x is so small that the channel is reduced to K parallel single- user OFDM A channels. In this 
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case, users do not interfere with each other and NE, OSE and ASE all achieve sum-rate performance close 
to the centralized ISB, with ASE being slightly inferior than NE and OSE. Basically, all the game-theoretic 
approaches operate under an assumption that the players' strategies affect the interference patterns the 
users see but for small x, this is no longer true. For this reason, as ASE users are the most aggressive ones 
in playing, they are slightly inferior than NE and OSE users. From our convergence analysis in Section 
IV, it is understood that the superior performance of ASE is due to the possibility of having a large pk 
leading to convergence in region 2 in Fig. 3. Using (55), for ASE, we can roughly estimate 



Therefore, for ASE to work well, we need to have 6 > p 0.035 (the dominant sufficient condition for 
ASE to converge), which agrees very much with the threshold of x for Region (1). 

Region (2) covers the most typical scenarios of interference channels. In this region, the performance 
differences between ISB and game-theoretic equilibria become much more significant. In particular, the 
sum-rates of NE and OSE are significantly compromised, while ASE is able to achieve the sum-rate close 
to that of ISB. Finally, in Region (3), NE, OSE and ASE all appear to converge to the same performance 
which is also very close to that for ISB. This is because when x is very large (i.e., the interference links are 
much stronger than the desired links), this forces the gaming strategies to completely avoid users sharing 
any subcarriers [21,22]. Consequently, they all perform equally and as well as ISB. 



This paper studied the competition properties of an OFDMA channel. By introducing the environmental 
and operational interference derivatives, we devised iterative GWF algorithms that can take users to the 
ASE, by exploiting local CSI at the users, resulting self-optimized OFDMA. Results revealed that ASE 
achieves the average system sum-rate very close to that of ISB, a centralized near-optimal solution. 
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Table 1: Simulation results for the 2-user 2-subcarrier deterministic channel. 



Equilibrium 


NE 


OSE 


ASE 


User k 


User 1 


User 2 


User 1 


User 2 


User 1 


User 2 


Pk[l] 


2.00 


8.00 





9.00 





9.99 


Pk[2] 


8.00 


2.00 


9.99 


1.00 


9.99 





Rate Rk 


2.6439 


2.6438 


2.9386 


3.4739 


3.4594 


3.4594 



Table 2: Number of times for divergence in 1000 independent simulations. 



(K,N) 


(3,9) 


(4, 16) 


(5,15) 


Equilibrium 


NE 


OSE 


ASE 


NE 


OSE 


ASE 


NE 


OSE 


ASE 


x = 5 











5 


2 





9 


7 





x = 0.8 


78 


58 





310 


251 





386 


323 





x = 0.5 


23 


25 





115 


122 





200 


203 





x = 0.2 








6 








39 








48 



Table 3: Average users' sum-rates for the 3-user 9-subcarrier interference channel. 



Sum-rate 


User 1 


User 2 


User 3 


NE 


13.9944 


13.8966 


13.8021 


OSE 


16.5275 


15.5493 


15.4743 


ASE 


32.6641 


32.8518 


33.0399 



Table 4: The average per-user sum-rates for ASE and ISB. 



(K,N) 


(3,9) 


(6,18) 


(12,36) 


ASE 


32.8414 


32.2201 


31.6874 


ISB 


34.6674 


35.5500 


35.6335 
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Figure 1: A multi-user interference channel model. 




Figure 2: The power game subsystem. 
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Figure 3: The sufficient convergence regions. 
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Figure 4: A four-ray Rayleigh fading channel model. 
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Figure 5: The cumulative density functions for the rate-ratios. 




Figure 6: The average system sum-rate comparison for various x. 
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